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(54) ENZYME ENDOGLUCANASE AND CELLULASE PREPARATIONS CONTAINING THE SAME 

(57) A highly active cellutase which is to be prefera- 
bly employed in eliminating fluffs from cellulose-contain- 
ing fibers, denier reduction and bleaching denim-dyed 
cellulose fibers; and a gene thereof. This novel cellulase 
NCE4 isolated from Humicofa insolens is a highly active 
one usable in treating various cellulose-containing fib- 
ers. 
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Description 

[Background of the Invention] 
s Field of the Invention 

[0001] The present invention relates to cellulase and a cellulase preparation containing the same, as well as a 
removal of nap of cellulose-containing fibers, a process for reducing cellulose-containing fibers and a process for 
decoloring denim-dyed cellulose-containing fibers by using the cellulase or cellulase preparation. 

w 

Background Art 

[0002] Cellulose-containing fibers are treated with cellulase to give desired properties to those fibers. For example, 
in the fiber industry, treatment is performed using cellulase in order to improve the texture and appearance of cellulose- 

15 containing f foers, and to give the appearance of being stonewashed to denim-dyed cellulose-containing fibers. 

[0003] In addition, Tencel, a reproduced cellulose fiber that is spun wood pulp-based cellulose dissolved in an organic 
solvent, has attracted attention in recent years due to properties such as high strength, water absorbing capacity and 
the like, and its manufacturing method which causes very little environmental contamination. However, since nap is pro- 
duced during the manufacturing process of Tencel, Tencel has low product value as a fiber if left as is. Therefore, meth- 

20 ods for removing nap formed during the manufacturing process by using cellulase have been proposed. 

[0004] At present, cellulases primarily originating in Trichoderma species and Humicola species of wood-rotting fungi 
are used for treatment of cellulose-containing fibers. However, a large amount of cellulase is required under the present 
conditions to demonstrate the desired effect on the fibers. 

[0005] It is likely that the cost of this processing would be able to be reduced, if it were possible to be able to improve 
25 the texture and appearance of cellulose-containing fibers, give the appearance of being stone-washed to denim-dyed 
cellulose-containing fibers, and remove nap from Tencel by using highly active cellulase in smaller amount than the con- 
ventional amount. 

[0006] Furthermore, an example of a cellulase originating in Humicola, it is shown in WO 91/1 7243 (Japanese Patent 
Laid-Open Publication No. 5-509223) that the endoglucanase gene of 43 kD was isolated from Humicola insolens 
30 DSM1800 strain, and that its base sequence was determined. 

[Summary of the Invention] 

[0007] The inventors of the present invention have now isolated from Humicola insolens a novel highly active cellulase 
35 and its gene, which can be used extremely advantageously in the treatment of various cellulose-containing fibers. The 
present invention is based on those findings. 

[0008] Thus, an object of the present invention is to provide a novel cellulase and its gene. 
[0009] Moreover, another object of the present invention is to provide a process for removing nap of cellulose-con- 
taining fibers using a novel cellulase, a process for reducing cellulose-containing fibers and a process for decoloring 
40 denim-dyed, cellulose-containing fibers. 

[001 0] The novel cellulase according to the present invention is a protein or a modified protein having a portion of an 
amino acid sequence shown in SEQ ID No. 1 or a sequence from position 1 to position 284 of the amino acid sequence 
shown in SEQ ID No. 1. 

[0011] In addition, according to the present invention, there is provided a process for removing nap of cellulose-con- 
45 taining fibers, a process for reducing cellulose-containing fibers, and a process for decoloring denim-dyed, cellulose- 
containing fibers comprising the step of contacting the protein or modified protein of the present invention With the 
denim-dyed, cellulose-containing fibers. 

[Detailed Description of the Invention] 

50 

Microorganism Deposition 

[0012] E. coli JM109 strain transformed by plasmid pNCE4Sal (see Example A5) containing the cellulase gene 
according to the present invention has been deposited in the National Institute of Bioscience and Human-Technology, 
55 Ministry of International Trade and Industry (1-1-3 Higashi, Tsukuba-shi. Ibaraki-ken, Japan) under accession No! 
FERM BP-5976 (original deposit: FERM P-15732, date of original deposit: July 12, 1996). 
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Cellulase and its Gene 

[001 3] The cellulose enzyme according to the present invention is a protein having a portion or a sequence from posi- 
tion 1 to position 284 of the sequence shown in SEQ ID No. 1 . In the present invention, a portion of the sequence shown 
5 in SEQ ID No. 1 refers to a partial sequence, for example, that has a length available for a probe, or that still maintains 
cellulase activity, particularly endoglucanase activity. 

[0014] In addition, the present invention also includes a protein having a portion or all of the amino acid sequence 
from position -22 to position -1 of SEQ ID No. 1 on the N-terminal of the above protein. The amino acid sequence from 
position -22 to position -1 of SEQ ID No. 1 is considered to be a signal peptide. Therefore, in addition to the partial 

10 sequence that retains signal peptide activity, a portion of the sequence includes the sequence that remains on the N- 
terminal as a result of the processing in which the processing position may vary depending on the type of a host. 
[001 5] Moreover, modified proteins of the above protein are also encompassed by the scope of the present invention. 
In the present invention, a modified protein refers to a protein in which a modification, such as the addition, insertion, 
deletion or substitution of one or a plurality of amino acids, has occurred in the amino acid sequence of the protein, 

is while still retaining cellulase activity, particularly endoglucanase activity. 

[001 6] The sequence from position 1 to position 284 of the amino acid sequence shown in SEQ ID No. 1 is hereinafter 
referred to as cellulase NGE4, and its gene is referred to as the cellulase NCE4 gene. 

[001 7] The cellulose according to the present invention has high activity, and thus allows for the desired effects in var- 
ious uses even in small amounts. For example, according to a preferred embodiment of the present invention, in com- 

20 parison with crude cellulase prepared from a culture liquid of Humicola insolens, NCE4 shows equivalent nap removal 
effects on cellulose fibers at a protein mass of about 1 /1 00 that of the crude cellulose, equivalent denim-dyed cellulose- 
containing fiber decoloring effects at a protein mass about 1/25, and equivalent cellulose fiber reduction effects at a pro- 
tein mass of about 1/5. Accordingly, it can be used for more efficient and economical treatment of cellulose fibers. 
[001 8] According to another embodiment of the present invention, there is provided a DNA sequence that codes for 

25 the amino acid sequence of the above protein. A typical sequence of the DNA sequence has a portion or ail of the base 
sequence shown in SEQ ID No. 2. 

[001 9] The base sequence shown in SEQ ID No. 2 has an open reading frame starting at ATG at positions 1 1 8 to 1 20 
and terminating at TAA at positions 1 089 to 1 09 1 . In addition, the base sequence from position 1 71 to position 1 73 cor- 
responds to the mature protein composed of 284 residues. Moreover, the presence of an intron has been confirmed in 

30 the base sequence of SEQ ID No. 2 (see Example A7). 

[0020] When the amino acid sequence of a protein is given, the DNA sequence coding for that sequence is easily 
defined, and various base sequences that code for a portion or all of the amino acid sequence shown in SEQ ID No. 1 
can be selected. Thus, a DNA sequence coding for a portion or all of the amino acid sequence shown in SEQ ID No. 1 
according to the present invention refers to a sequence having any degenerate codons coding for the same amino acids 

35 in a base sequence, in addition to a portion or all of the base sequence shown in SEQ ID No. 2. 

[0021 ] The DNA according to the present invention may be of natural origin or be completely synthesized. In addition, 
it may also be synthesized using a portion of DNA of natural origin. Typical methods for obtaining DNA include a method 
of obtaining the DNA from a Humicola insolens chromosome library or cDNA library routinely employed in the field of 
genetic engineering, an example of which includes screening using a suitable DNA probe prepared based on informa- 

40 tion about a partial amino acid sequence. In addition, the DNA can also be obtained from the deposited microorganism 
described above. 

Expression Vector and Transformed Microorganism 

45 [0022] According to the present invention, there is provided an expression vector that contains the DNA sequence 
according to the present invention in a state that allows the DNA sequence to replicate within a host microorganism and 
to express a protein encoded by the DNA sequence. Moreover, according to the present invention, there is provided a 
microorganism that is transformed by the expression vector. There are no particular limitations on this host-vector sys- 
tem, and for example, systems using E. coli, Actinomycetes, yeast or molds, as well as fused protein expression sys- 

50 terns with other proteins using these microorganisms, can be used. Methods routinely employed in the field of genetic 
engineering can be used for the procedure and method for constructing the vector according to the present invention. 
[0023] The expression vector of the present invention may contain, in addition to the DNA sequence of the present 
invention, a DNA sequence that controls the expression of a desired protein or a selectable marker to ensure the intro- 
duction of the vector into a host and the expression of the desired protein. Since the base sequence of SEQ ID No. 2 is 

55 considered to contain there regulatory sequence, the use of this base sequence may be advantageous. In addition, the 
expression vector of the present invention may contain DNA sequences that code for cellulase in a repetitive form (tan- 
dem). The expression vector of the present invention that contains the DNA sequence of the present invention may be 
constructed in accordance with routine methods. In addition, transformation of a microorganism by the expression vec- 
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tor of the present invention may also be carried out in accordance with methods routinely employed in the art. 
[0024] The transfer mant of the present invention can be cultured in a suitable medium. The protein according to the 
present invention can be isolated from this culture. Thus, according to another embodiment of the present invention, 
there is provided a method for preparing the novel protein of the present invention. The culturing of the transformants 
5 and its conditions may be basically similar to those of the microorganism used. In addition, recovery and purification of 
the novel protein according to the present invention from the culture can be performed in accordance with routine meth- 
ods. 

Cellulase Applications/Ceilulase Preparation 

10 

[0025] The cellulase according to the present invention is used in various applications in its original form or as a cel- 
lulase preparation. In particular, it is used to give desired properties to cellulose fibers. More specifically, it is used for 
removing nap of cellulose-containing fibers, performing weight reduction processing, and performing decoloring 
processing of denim-dyed, cellulose-containing fibers. 
15 [0026] A cellulase preparation may be prepared by mixing the cellulase according to the present invention with ingre- 
dients generally contained in cellulase preparations, examples of which include excipients (e.g., lactose, sodium chlo- 
ride, sorbitol), surfactants and antiseptics. 

[0027] A process for removing nap of cellulose-containing f foers, a process for reducing cellulose-containing fibers, 
and a process for decoloring denim-dyed, cellulose-containing fibers using the cellulase or cellulase preparation 
20 according to the present invention can be carried out by contacting the cellulase of the present invention with cellulose- 
containing fibers. 

[0028] Although contact temperature, the amount of cellulase and other conditions may be decided depending on var- 
ious conditions in a system, for example, in nap removal of cellulose-containing fibers, treatment can be performed at 
about 50-60°C using cellulase at a protein concentration of 5-50 mg/liter. In the process for reducing cellulose-contain- 
25 ing fibers, treatment can be performed at about 50-60°C using cellulase at a protein concentration of 100-300 mg/liter. 
Moreover, in the process for decoloring denim-dyed, cellulose-containing fibers, treatment can be performed at about 
50-60°C using cellulase at a protein concentration of 2-10 mg/liter. 

[Examples] 

30 

[0029] The present invention will be further explained in view of the following examples that are not meant to limit the 
invention. 

Example A1 : Isolation and Purification of an Ingredient Having the Tencel Nap Removing Activity from Humicola Inso- 
35 lens 

[0030] Humicola insolens MN200-1 was cultured at 37°C in (N) medium (5.0% Avicel. 2.0% yeast extract, 0.1% 
polypeptone, 0.03% calcium chloride, 0.03% magnesium sulfate, pH 6.8). After culturing for 7 days, by centrifuging the 
resulting culture liquid at 7000 rpm for 20 minutes, the microorganisms were removed and the culture supernatant was 

40 used as the crude cellulase preparation. 

[0031] This crude cellulase preparation was applied to hydrophobic chromatography (Phenyl-Sepharose High-Per- 
formance 16/100, Pharmacia-Biotech), and fractionated by eluting with aqueous ammonium sulfate over a concentra- 
tion gradient of 1 to 0 M in 50 mM phosphate buffer (pH 7.0). Since potent Tencel nap removing activity was observed 
in the fraction obtained during a concentration gradient of 0.1 to 0 M, that fraction was again applied to hydrophobic 

45 chromatography (Phenyl-Sepharose High-Performance 16/100) and eluted with aqueous ammonium sulfate over a 
concentration gradient of 0.4 to 0 M in 50 mM phosphate buffer (pH 7.0) followed by removal of the active fraction. 
[0032] Then, this fraction was applied to reversed-phase chromatography (Source 15 ISO, Pharmacia-Biotech) and 
fractionated by eluting with aqueous ammonium sulfate over a concentration gradient of 1 to 0 M in 50 mM phosphate 
buffer (pH 7.0). Since the fraction obtained during a concentration of 0 M was observed to exhibit potent Tencel nap 

so removing activity, that fraction was further applied to reversed-phase chromatography (Source 1 5 PHE, Pharmacia-Bio- 
tech), eluted with aqueous ammonium sulfate over a concentration gradient of 1 to 0 M in 50 mM phosphate buffer (pH 
7.0), and the fraction exhibiting potent Tencel nap removing activity was isolated as purified enzyme NCE4. This NCE4 
exhibited a single band at a molecular weight of 43 kDa in SDS-PAGE. 

55 
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Example A2: Partial Amino Acid Sequence of Cellulase NCE4 

(1) Identification of N-terminal amino acid residues 

5 [0033] Column chromatography (column: RESOURCE (Trade name) RPC 3 ml. 5-60% acetonitrile gradient contain- 
ing 0.1% TFA) was performed with an FPLC system (Pharmacia-Biotech) followed by removal of the main peak to 
determine the N-terminal amino acid sequence of the protein purified in Example 1 . 

[0034] After lypophilizing the above fraction, it was dissolved in a small, amount of water followed by electrophoresis 
using 8% Gel SDS-PAGE Mini (TEFCO). After electrically transferring the protein to PVDF membrane (Millipore) using 
10 a Multiphore II electrophoresis system (Pharmacia-Biotech), and staining with Coomassie brilliant blue R-250 (Nakalai 
tesque) the membrane was decolored, washed with water and air dried. The portion at which the protein having a 
molecular weight of 43 kDa was blotted was cut out and applied to the Model 492 Protein Sequencer (Perkin-Elmer) to 
determine 1 5 residues of the N-terminal amino acid sequence. The resulting sequence was as shown below. 

15 N-terminal amino acid sequence: Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr-Trp-Asp-(Cys)-(Cys)-Lys-Pro-Ser (15 residues) 

(2) Peptide Mapping 

[0035] After lyophilizing the protein purified by FPLC in (1 ) above, it was dissolved in 1 00 mM ammonium bicarbonate 
20 buffer (pH 8.0). Approximately 1/20 mole volume of trypsin (Promega) relative to the amount of protein was added and 
allowed to react for 48 hours at 37°C. Column chromatography (column: C8 220 x 2.1 mm, 0.1% TFA, 0% acetonitrile 
to 0.085% TFA, 35% acetonitrile gradient) was performed on this dissolution product with the Model 172u Preparative 
HPLC System (PerWn-Elmer) to obtain three types of peptides. The amino acid sequences of the resulting peptide frag- 
ments were determined using the above protein sequencer. Those results were as shown below. 

25 
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35 

[0036] Since the N-terminal amino acid sequence and amino acid sequences obtained by peptide mapping exhibit 
homology with the amino acid sequence of 43 kDa endoglucanase obtained from Humicola insolens DSM1800 shown 
in WO 91/17243 (Japanese Patent Laid-Open Publication No. 5-509223). this protein was strongly suggested to be a 
type of cellulase. 

40 [0037] In addition, when the above sequences were compared with the sequences registered in Protein Identification 
Resource (PIR), R44.0, March 1995 or those registered in SWISS-PROT R31.0. March 1995, although the above 
sequences exhibited homology with some registered sequences, they were not identical, thus clearly indicating the pro- 
tein obtained herein to be a novel protein. 

45 Example A3: Construction of Genome DNA Library 

[0038] Isolation of genome DNA was performed in accordance with the method of Horiuchi, et at. (Hiroyuki Horiuchi 
et al., J. Bacteriol., 170:272-278, 1988). 

[0039] To begin with, Humicola insolens MN200-1 was cultured in the above (N) medium at 37°C. After culturing for 
so 2 days, the microorganisms were collected by centrifugation (3500 rpm, 10 minutes). The resulting microorganisms 
were treated with phenol, proteinase K and ribonuclease A followed by polyethylene glycol (PEG) precipitation to obtain 
genome DNA. 

[0040] Then, Humicola insolens genuine DNA was digested by Sau3A I, and after confirming partial digestion within 
a range of 9-23 kbp by agarose gel electrophoresis, the DNA fragments were collected by ethanol precipitation. The 
55 DNA fragments were then ligated to a phage vector and the BamH I arm of an EMBL3 cloning kit (Stratagene) using T4 
ligase (Toyobo Co., Ltd.). After subjecting this to ethanol precipitation, the precipitate was dissolved in TE (10 mM Tris- 
HCI (pH 8.0). 1 mM EDTA) buffer. 

[0041 ] The entire amount of ligated mixture was packaged in a lambda head using package components frozen as 
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described in the method of Hohn, B. (Hohn, B., Methods Enzymol., 68:299-309, 1979) and the Giga-Pack II Packaging 
Kit (Stratagene), after which the resulting phage was infected into E, cgji strain LE392. The target gene was then cloned 
using the 5 x 1 04 phage library resulting from this method. 

Example A4: Preparation of Long Chain Probe by PCR 

[0042] A long probe amplified by PC R was prepared using the total DN A of Humicola insolens as a template, and this 
was used as the DNA probe. 

[0043] DNA corresponding to the amino acids indicated with an asterisk on the N-terminal and peptide TP-3 were 
synthesized for each of the primers. The sequences of the prepared synthetic oligonucleotides were as shown below. 

NCE4N1 :5 , -GCXGA(CT)GGXM(AG)TC(AGC"0AC-3' (17mer) 
NCE4N2:5'GCXGA(CT)GGXAA(AG)AG(CT)AC-3' (1 7mer) 
NCE4C: 5-'CXGC(AG)TT(CT)TT(AG)AACCA(AG)TC-3' (19mer) 
(X: inosine) 

[0044] PCR reactions were conducted under the following conditions. To begin with, 2 pairs of tubes of 1 uM each of 
primers NCE4N1 and NCE4C added to 1 m9 of Humicola insolens genome DNA, and 1 uM each of primers NCE4N2 
and NCE4C added to 1 ug of Humicola insolens genome DNA were prepared, and then subjected to thermal denatur- 
ation for 5 minutes at 95°C in the presence of dNTP. Later, Taq polymerase (recombinant Taq, Takara Shuzo) was 
added to amplify by repeating 25 cycles under reaction conditions of 94°C for 1 minute, 45°C for 2 minutes and 72°C 
for 3 minutes. As a result, approximately 750 bp DNA was amplified only in the case of using primers NCE4N1 and 
NCE4C. This was used as the probe in following screening. 

Example A5: Cloning of Cellulase Component NCE4 Gene 

(1) Screening by Plaque Hybridization 

[0045] 1 00 ng of the approximately 750 bp DNA fragment amplified by PCR was labeled in advance by an ECL Direct 
DNA/RNA Labeling Detection System (Amersham). 

[0046] The phage plaque prepared according to the method described in Example 2 was transferred to High-Bond N 
+ Nylon transfer membrane (Amersham), and after denaturing with 0.4 N sodium hydroxide, it was washed with 5-fold 
concentrated SSC (15 mM trisodium citrate, 150 mM sodium chloride) and dried to fix the DNA. After pre-hybridizing 
(42°C) for 1 hour according to the method of the kit, the previously labeled probe was added after which hybridization 
was performed for 4 hours (42°C). Washing of the label was performed in accordance with method of the above kit. To 
begin with, washing was repeated twice for 20 minutes at 42°C using 0.5-fold concentrated SSC containing 0.4% SDS 
and 6 M urea, followed by washing twice for 5 minutes at room temperature using 2-fold concentrated SSC. 
[0047] After immersing the Nylon membrane from which the probe had been washed in the detection solution pro- 
vided for 1 minute, it was photosensitized to Hyper-Film ECL (Amersham) to obtain four positive clones. 

(2) Preparation of Phage DNA 

[0048] E. coil LE392 was infected with a phage, the phage particles were collected 8 hours later, end after treating 
with proteinase K and phenol according to the method of Grossberger (Grossberger, D., Nucleic Acids Res. 
1 5,6737, 1 987), the phage DNA was separated by ethanol precipitation. 

(3) Subcloning of Target Gene 

[0049] Four types of phage DNA were digested by Sal I and applied to agarose gel electrophoresis. 
[0050] DNA was transferred to a Nylon membrane by the method of Southern (Southern, E.M., J. Mol. Biol., 98: 503- 
517, 1975), and hybridized using a probe of approximately 750 bp under the same conditions as the plaque hybridiza- 
tion of (1) above followed by detection of the 5.2 kbp DNA fragment containing the target gene. As a result, the four 
types of phage DNA has Sal I fragments of the same size. 

[0051 ] This 5.2 kbp DNA fragment was separated using the Sephaglass Band Prep Kit (Pharmacia-Biotech) followed 
by subcloning to the Sal I site of plasmid pUC119 using E. coil JN109. The resulting plasmid was designated as 
pNCE4Sal. 
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Example A6: Determination of Base Sequence 
(1) Base Sequence Analysis of Genome DNA 

5 [0052] Determination of base sequence was performed in the manner described below. 

[0053] The A.L.F. DNA Sequencer II (Pharmacia-Biotech) was used for the base sequence analyzer. Acrylamide car- 
rier able to be acquired as Ready-Mix Gel (Pharmacia-Biotech) or Hydrolink Long Ranger (FMC) was used for the 
sequencing gel. A. LP grade reagents (Pharmacia-Biotech) were used for the various reagents used for gel preparation 
(N.N.N'.N'-tetramethylethylenediamine, urea, ammonium persulphate). The Auto-Read Sequencing Kit (Pharmacia- 

w Biotech) was used for the base sequence decoding reaction. Gel preparation conditions, reaction conditions and migra- 
tion conditions were set by referring to the details of each instruction manual. 

[0054] After alkaline-denaturing pNCE4Sal with 10 ug of 2 M sodium hydroxide to use it as a template DNA, it was 
annealed with the universal primer provided with the Auto-Read Sequencing Kit to perform an elongation reaction. After 
sequencing the reaction product with the sequencer, a 546 bp base sequence was sequenced. Based on this result, an 
is FITC-labeled sequencing primer in the form of MNEG01 was prepared, reacted with pNCE4Sal followed by further 
sequencing. The following primer was then prepared based on the result obtained after which the entire length of NCE4 
was decoded as a result of proceeding with decoding. The prepared FITC-labeled sequencing primers were as indi- 
cated below. 

20 MNEG-01 : 5'-GTGATGAGGGCTGGCGACAGGCC-3' (23mer) 
MNEG-02: 5*-CTGCCACCTCTATTGCCGGCAGC-3' (23mer) 
MNEG-03: 5'-CCCGACGCCCTCAAGCCCGGCTG-3' (23mer) 
MNEG-04: 5'-GGCTGGAGCGGCTGCACCACCTG-3' (23mer) 

25 (2) Base Sequence Determination 

[0055] FITC-labeled sequencing primer DNA in the form of MNEG-05 through MNEG-08 were synthesized based on 
the results of (1) above. The prepared FITC-labeled sequencing primers were as indicated below. 

30 MNEG-05; 5'-GACCTGACGGAAGCTGAAGCTCG-3' (23mer) 
MNEG-06: 5'-AGCAGTGCAGCCGCTGGGAGTCG-3' (23mer) 
MNEG-07; 5'-TGGCAGATGAGGACGTGGTGTTG-3' (23mer) 
MNEG-08: 5' CGCAGCCGGACTTGGCGTCGAAG-3' (23mer) 

35 [0056] These primers were reacted with pNCE4Sal using the Auto-Read Sequencing Kit. First of all, 1 0 ug of plasmid 
were alkaline-denatured, annealed with each primer, and allowed to react with T7 polymerase. As a result, a 1257 bp 
base sequence was able to be determined in the Sa! I fragment. This sequence was as shown in SEQ ID No. 3. 

Example A7: Determination of Intron 

40 

[0057] Determination of intron was performed by preparing mRNA from Humicola insolens MN200-1, synthesizing 
cDNA using reverse transcriptase, and comparing this with the genome base sequence to identify the same region. 

(1) Isolation of Whole RNA 

45 

[0058] Humicola insolens MN200-1 was cultured for 2 days in cellulase-inducing medium, and preferable the previ- 
ously mentioned (N) medium, followed by collection of the microorganisms by centrifugation (3500 rpm, 10 minutes). 2 
g of the above microorganisms were washed with sterile water and crushed with a blender (Nihon SeiW, Homogenizer 
AM-3) while frozen with liquid nitrogen. The crushed microorganisms were then suspended in 1 0 ml of denaturing solu- 

50 tion containing 4 M guanidine thiocyanate (4 M guanidine thiocyanate, 25 mM trisodium citrate, 0.5% sodium N-lauryl- 
sarcosinate, 0.1 Mmercaptoethanol). After stirring for several minutes at room temperature, the suspension was 
neutralized with 1 ml of 2 M sodium acetate (pH 4.5) and additionally stirred after adding 10 ml of TE saturated phenol. 
2 ml of chloroform-isoamyl alcohol (24:1) were added and after stirring well, the microorganism components denatured 
with phenol were removed by centrifugation (3500 rpm, 10 minutes). The upper layer (aqueous layer) was aspirated off 

55 and nucleic acids were precipitated with 1 0 ml of isopropanol. The resulted precipitate was then centrifuged (3500 rpm, 
10 minutes) to recover the nucleic acids after which the precipitate was washed with 70% ethanol-water by additional 
centrifugation. 

[0059] After dissolving this precipitate in 3.5 ml of TE, 880 ^l of 1 0 M lithium chloride solution were added to the solu- 
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tion followed by refrigerating for 2 hours at 5°C and centrifugation (12,000 rpm, 10 minutes) to recover the precipitate. 
This precipitate was washed with 70% ethanol and used as the whole RNA fraction. The recovered amount was 2. 7 mg 
and the yield was 0.14%. 

s (2) Preparation of Poly A Tail+ RNA (= mRNA) 

[0060] Preparation of mRNA was performed using a mRNA purification kit (Pharmacia-Biotech). 
[0061 ] To begin with, 1 mg of the whole RNA prepared in (1) above was dissolved in 1 ml of elution buffer followed by 
thermal denaturation treatment for 1 0 minutes at 65°C. After rapidly cooling in ice, 0.2 ml of sample buffer were added. 
10 The entire amount of this RNA solution was charged into an oligo(dT) cellulose column, and after washing the column 
three times with high-salt buffer and 3 times with low-salt buffer, eluted with elution buffer boated to 65°C. This column 
procedure was repeated twice and the resulting product was used as the mRNA fraction. The amount recovered was 
19.2 ng the yield was 2%. 

is (3) Synthesis of cDNA 

[0062] Synthesis of cDNA was performed using a time-saver cDNA synthesis kit (Pharmacia-Biotech). 
[0063] 5 ng of mRNA were dissolved in 20 ug of sample buffer. After thermal denaturing for 10 minutes at 65°C, dithi- 
othreitol. solution and o!igo(dT) primer were both added to first strand synthesis mix and allowed to react for 1 hour at 
20 37°C. Then, this entire amount was added to second strand mix and allowed to react for 30 minutes at 12°C and then 
for 1 hour at 22°C, and the resulting product was used as cDNA. 

(4) PCR Amplification of Cellulase NCE4 cDNA 

25 [0064] Using 1 ug of the synthesized cDNA as a template, only the target cDNA was amplified by PCR. Oligonucle- 
otide primers having the sequences indicated below were prepared for use as primers of the N- and C-terminals. 

NCE4-CN: 5'-ATGCGTTCCTCCCCTCTCCTCCGCTCCGCC-3' (30mer) 
NCE4-CC: 5'-TACAGGCACTGATGGTACCAGTCATTAATC-3' (30mer) 

30 

[0065] The PCR reaction was carried out under the following conditions. 1 uM each of primer was added to 1 ug of 
Humicola insolens cDNA followed by thermal denaturation for 10 minutes at 94°C in the presence of dNTR Then, Taq 
polymerase (recombinant Tag, Takara Shuzo) was added to amplify by repeating 30 cycles under reaction conditions 
of 94°C for 1 minute, 50°C for 2 minutes and 72°C for 3 minutes. As a result of performing agarose gel electrophoresis, 
35 the amplified fragment was determined to be 0.9 kbp in size. This was then concentrated by ethanol precipitation and 
cloned using a pT7 blue-T vector kit (Novagen). This plasmid was designated as pCNCE4. 

(5) Analysis of cDNA Base Sequence 

40 [0066] The sequencing reaction was performed using an Auto-Read Sequencing Kit in the same manner as previ- 
ously described. 

[0067] Plasmid pCNCE4 was alkaline-denatured with 2M sodium hydroxide and precipitated with ethanol. This single- 
strand plasmid was used as a template and allowed to react with T7 polymerase. The sequence was determined by per- 
forming reactions using the above synthetic primers MNEG01, MNEG02, MNEG03, MNEG04, MNEG05, MNEG06, 
45 MNEG07 and MNEG08 as well as the universal primer and reverse primer provided with the kit. 

[0068] As a result, a single, 56 bp intron was present. The non-translated initiation sequence, its stop sequence, and 
the regulatory sequence within the intron in the sequence of SEQ ID No. 3 are as shown below (the numbers indicate 
the sequence position numbers of SEQ ID No. 3). 

so Intron: 453-458, 506-508, 491 -497 

Example A8: Evaluation of the Nap Removing Activity of NCE4 on Cellulose-Containing Fibers 

[0069] Pre-dyca Tencel fabric (Courtauls) was napped using a large washer adding surfactant and a rubber ball. Ten- 
55 eel in which nap were formed as a result of this treatment was removed of nap by cellulase treatment under the following 
conditions followed by calculation of the protein concentration of cellulase required to completely remove the nap. 

Test machine: 20 kg washer (Sanyo, Fully- Automated Washing Machine SCW5101) 
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Bath ratio: 1:20 

Heating: 55°C 

Time: 60 minutes 

pH: 7 (10 mM phosphate buffer) 

[0070] In addition to cellulase enzyme liquid, rubber balls were added in an amount roughly twice the weight of the 
cloth to the treatment liquid. 



Table 1 







Amt. of Protein Required 






to Remove Nap from 






Tencel* 


75 


NCE4 


5 mg/liter 




Crude Cellulase Preparation 


500 mg/liter 



•: The amount of protein was quantified by using a protein 
assay kit (Biorad) and bovine serum albumin for the stand- 



[0071] Based on the results, it was determined that NCE4 is able to demonstrate an equal degree of Tencel nap 
removal at a protein concentration 1/100 that of crude cellulase preparation. 

25 Example A9: Evaluation of Decoloring Activity of NCE4 on Denim-Dyed, Cellulose-Containing Fibers 

[0072] Destarched 1 2-ounce blue jeans were decolored under the conditions described below. 



Test machine: 20 kg washer (Sanyo, Fully-Automatic Washing Machine SCW5101) 
30 Bath ratio; 1:50 
Heating: 55°C 
Time: 60 minutes 
pH; 7 (10 mM phosphate buffer) 



35 [0073] In addition to cellulase enzyme liquid, rubber balls were added in an amount roughly twice the weight of the 
cloth to the treatment liquid. 

[0074] Decoloring was determined by measuring the L value (lightness) of an Lab indication system using a color dif- 
ference type of Color Analyzer Topscan Model TC-1800MK2 (Tokyo Denshoku). Representing the increase in L value 
relative to a control (increase in lightness) as the AL value, the AL values of 5 points for each test area evaluated for 
40 decoloring were measured (n = 5), and the mean value of 3 points after discarding the maximum and minimum values 
was used. The protein concentration of cellulase required to decolor to a level of AL = 7 was calculated. 



Table 2 





Amt. of Protein Required 
to Discolor Blue Jeans 


NCE4 

Crude cellulase preparation 


1 .8 mg/liter 
45.0 mg/liter 



[0075] Based on these results, NCE4 was determined to be able to demonstrate an equal degree of discoloring of 
denim-dyed blue jeans at a protein concentration 1/25 that of crude cellulase preparation. 

55 Example A10: Evaluation of Weight Reduction Activity of NCE4 on Cellulose-Containing Fibers 

[0076] Enzyme treatment was performed on Cupra, Recycled cellulose fibers (Asahi Chemical Industry, 15 cm long 
x 10 cm wide) for which absolute dry weight was measured in advance under the conditions indicated below. 
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Test machine: Washing resistance tester L-12 (Daiei Scientific Instruments) 
Bath ratio: 1:50 
Heating: 55°C 
Time: 60 minutes 
5 pH: 7 (40 mM phosphate buffer) 

[0077] In addition to cellulase enzyme liquid, stainless steel balls provided (Daiei Scientific Instruments) were added 
to the treatment liquid. Following enzyme treatment, the fibers were dried, the absolute dry weight of the Cupra fibers 
were measured, and the weight reduction ratio relative to before enzyme treatment was measured. The protein concen- 
10 tration of cellulase required to produce a weight reduction ratio of 8% was calculated. 



Table 3 





Ami of Protein Required 
for Cupra Weight Reduc- 
tion of 8% 


NCE4 

Crude cellulase preparation 


100 mg/liter 
500 mg/liter 



[0078] Based on these results, NCE4 was determined to be able to demonstrate an equal degree of weight reduction 
processing on Cupra fibers at a protein concentration 1/5 that of crude cellulase preparation. 

25 Example B1 : Construction of Plasmid pMKD01 

(1) Construction of Plasmid pUC1 18BN 

[0079] After cleaving 1 ug of pUC1 1 8 DNA with BamH I, the restriction enzyme was inactivated by treatment with phe- 
30 nol. This was then subjected to ethanol precipitation and dissolved in a small amount of TE (10 mM Tris-HCI (pH 8.0), 
1 mM EDTA) buffer. The ends of this DNA were blunted with a DNA branching kit (Takara Shuzo) . This was further 
ligated with a DNA ligation kit (Takara Shuzo) and allowed to self-close. The resulting ligated mixture was transformed 
into E. coli component celts JM109 (Takara Shuzo). Those transformants able to grow on LB agar medium (1% 
polypeptone, 0.5% yeast extract, 1% NaCI. 1.5% agar) containing 100 ^g/ml of ampicillin, 1 mM IPTG and 0.004% of 
35 X-ga! that also demonstrated white colonies were selected, and additionally cultured overnight at 37°C in LB medium 
(1% polypeptone, 0.5% yeast extract, 1% NaCI) containing 100 ng/ml of ampicillin. Plasmid DNA was recovered from 
the resulting culture liquid using alkaline-SDS. This plasmid DNA was cleaved with BamH I and applied to 0.8% agar- 
ose gel electrophoresis followed by selection of plasmid DNA in which the BamH I site of pUC1 18 DNA was destroyed. 
This plasmid DNA was designated as pUC1 1 8BN. 

40 

(2) Construction of Plasmid pUC1 18BSN 

[0080] 1 \ig of pUC1 18BN DNA was digested with Sph I and plasmid DNA was obtained in which the Sph I site of 
pUC1 18BN was destroyed according to the same method as described above. This plasmid DNA was designated as 
45 pUC118BSN. 

(3) Construction of Plasmid pM21 

A) Isolation of Cellulase NCE2 Gene 

50 

[0081] The cellulase NCE2 gene obtained from Humicola insolens according to the method described in Japanese 
Patent Laid-Open Publication No. 8-126492, and a Pst l-Xba I fragment of a total length 3.4 Kbp having a 1.4 Kb DNA 
sequence upstream and a 0.5 Kb DNA sequence downstream as its promoter and terminator regions, were ligated to 
the Pst l-Xba I site of the previous pUC1 18BSN. The resulting plasmid DNA was designated as pUCl 1 8BSN-PX. 

55 

B) Site-directed mutagenesis reaction of Plasmid pUC118BSN-PX 

[0082] BamH I sites were introduced by site-specific mutation as described below downstream of the N-terminal and 
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immediately downstream from the stop codon of NCE2 gene. E. coli JM109 strain was transformed by plasmid 
pUC1 18BSN-PX, and after additionally infecting with helper phage M13K07, culturing was performed for 16-20 hours 
at 37°C in 30 ml of 2xYT liquid medium (1 .6% bactotryptone, 0.8% yeast extract, 0.5% NaCI) containing ampicillin and 
kanamycin at concentrations of 150 ng/ml and 70 jig/ml, respectively. Single-strand DNA of M13 (ssDNA) was recov- 
5 ered from the culture supernatant. Site-directed mutagenesis reaction was performed using this ssDNA and two types 
of synthetic oligonucleotides with the Sculpture In Vitro Mutagenesis System (Amersham). The sequences of the pre- 
pared synthetic oligonucleotide primers are as shown below. 

MNC-02 5 , -GAGCGCCAGAACTGTGGATCCACTTGGTGAGCAATG-3 , (36mer) 
10 MNC-03 5' TCCGCCGTTCTGAGCGGATCCAGGCGTTTGGCGCG-3' (35mer) 

[0083] The mixture of site-directed mutated DNA was introduced into E. coli TG1 and the resulting transfbrmant was 
cultured in LB medium (1% polypeptone, 0.5% yeast extract, 1% NaCI) containing 100 ug/ml of ampicillin, followed by 
recovery of the plasmid DNA. This plasmid DNA was digested with BamH I and applied to 0.8% agarose gel electo- 
rs phoresis followed by selection of plasmid DNA in which BamH I sites were introduced at two locations in plasmid 
pUC1 18BSN-PX. This plasmid DNA was designated as pM21 . 

(4) Isolation of The Cellulase NCE3 Gene 

20 [0084] The cellobiohydrolase gene (NCE3) originating in Humicola insolens was isolated by PCR based on the 
sequence of known cellobiohydrolase gene originating in Humicola grisea (de Oliviera Alzevedo. M. et al., J. General 
Microbiol., 136: 2569-2576. 1990). 

A) Isolation of Genome DNA 

25 

[0085] Genome DNA of Humicola insolens MN 200-1 was obtained by the method of previously described Example 
A3. 

B) Amplification of The Cellulase NCE3 Gene by PCR 

30 

[0086] The NCE3 gene of Humicola insolens was isolated by PCR based on the sequence of cellobiohydrolase gene 
originating in Humicola grisea. Each primer was designed in advance in a form that contained a BamH I site so that the 
PCR product containing NCE3 is able to be ligated in line with the frame to the BamH I site of plasmid pM21 . Synthetic 
oligonucleotides having the sequences shown below were prepared for use as primers. 

35 

MKA-05: 5'-GCCGCCCAGCAGGCGGGATCCCTCACCACCGAGAGG-3' (36mer) 
MKA-06: 5'-TGATCGTCGAGTCAGGGATCCAGAATTTACAGGCAC-3' (36mer) 

[0087] The PCR reaction was carried out according to the following method in accordance with the LA PCR Kit Ver. 

40 2 (Takara Shuzo). To begin with, 1 nM of each primer, 400 nM dNTP and 2.5 U of LA Taq polymerase were added to 1 
ng of the Humicola insolens genome DNA obtained by the method described above followed by amplification by repeat- 
ing 30 cycles under reaction conditions of 94°C for 1 minute, 55°C for 2 minutes and 72°C for 3 minutes. As a result of 
0.8% agarose gel electrophoresis, amplification of 1 .6 Kbp DNA was confirmed. This 1 .6 Kbp DNA fragment was recov- 
ered using a Sephaglass Band Prep Kit (Pharmacia-Biotech), and then ligated to pT7 Wue-T vector kit (Novagen). This 

45 plasmid DNA was designated as pK21. 

(5) Construction of Plasmid pKM04 

[0088] Plasmid pK21 DNA was digested with BamH I and a 1.6 Kbp DNA segment was recovered. Then, plasmid 
so pM21 DNA was digested with BamH I after which restriction enzyme was inactivated by treating for 10 minutes at 70°C. 
This was dephosphated with calf alkaline phosphatase (Takara Shuzo) and separated by 0.8% agarose gel electro- 
phoresis followed by recovery of a 5.2 Kbp DNA fragment. The 1 .6 Kbp DNA fragment originating in pK21 and the 5.2 
Kbp DNA fragment originating in pM21 were ligated to obtain plasmid pKM04. 

55 (6) Construction of Plasmid pMKD01 

[0089] To begin with, a gene was prepared that allowed expression of the destomycin-resistant gene described in Jap- 
anese Patent Laid-Open Publication No. 59-1 75889 in Humicola insolens using the promoter and terminator of the trp 
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C gene originating in Aspergillus nidulans obtained by known methods (Mullaney, E J. et al.. Mol. Gen. Genet, 199: 37- 
45, 1985). This gene was introduced into the Xba I site of plasmid pKM04 to prepare plasmid pMKDOl . 

Example B2: Transformation of Humicola Insolens by Plasmid pMKD01 

5 

(1) Preparation of Highly Concentrated Purified Standard of Plasmid pMKDOl 

[0090] Highly concentrated purified standard of pMKDOl was first prepared in order to introduce plasmid pMKDOl 
into Humicola insolens. After introducing pMKDOl into E. coli JM109, it was cultured overnight at 37°C in 100 ml of LB 
10 medium containing 100 ug/ml of ampicillin. The resulting culture liquid was purified using a Flexiprep Kit (Pharmacia- 
Biotech) to obtain 1 ug/uJ of pMKDOl plasmid DNA. 

(2) Transformation of Humicola Insolens 

75 [0091] Humicola insolens MN200-1 was cultured at 37°C in (S) medium after which the microorganisms were col- 
lected by centrifugation for 10 minutes at 3000 rpm 24 hours later. Here, the composition of the (S) medium consisted 
of adding glucose (3.0%) but omitting Avicel from the previously mentioned (N) medium. The resulting microorganisms 
were washed with 0.5 M sucrose and suspended in 10 ml of protoplastic enzyme solution filtered with a 0.45 urn filter 
(5 mg/ml Novozyme 234 (NLI), 5 mg/ml Celtulase Onozuka R-10 (Yakult), 0.5 M sucrose). The mycelia were converted 

20 into protoplasts by shaking for 60 to 90 minutes at 30°C. After filtering this suspension, the protoplasts were recovered 
by centrifugation for 10 minutes at 2500 rpm and then washed with SUTC buffer (0.5 M sucrose, 10mM calcium chlo- 
ride. 10mM Tris-HCI (pH 7.5)). 

[0092] Protoplasts prepared in the above manner were suspended in 1 ml of SUTC buffer followed by the addition of 
10 »g of DNA (TE) solution (10 ul) to 100 ul of this suspension and allowing to stand undisturbed for 5 minutes in ice. 
25 Then. 400 ul of PEG solution (60% PEG4000, 1 0 mM calcium chloride, 1 0 mM Tris-HCI (pH 7.5)) were added, and after 
allowing to stand undisturbed for 20 minutes in ice, 10 ml of SUTC buffer were added followed by centrifugation for 10 
minutes at 2500 rpm. After suspending the collected protoplasts in 1 ml of SUTC buffer, they were again centrrf uged for 
5 minutes at 4000 rpm and finally suspended in 100 \i\ of SUTC buffer. 

[0093] Protoplasts subjected to the above treatment were layered with YMG soft agar over YMG medium containing 
30 200 ug/ml of hygromycin B (1% glucose, 0.4% yeast extract, 0.2% malt extract. 1% agar (pH 6.8)), and after culturing 
for 5 days at 37°C, the formed colonies were used as transformants. 

(3) Culturing of pMKDOl Transformed Strains and Evaluation by SDS-PAGE 

35 [0094] Plasmid pMKDOl was introduced into Humicola insolens MN200-1 as previously described followed by selec- 
tion of 50 strains exhibiting resistance to hygromycin. These were cultured for 5 days at 37°C in (N) medium. When the 
resulting culture supernatant was analyzed by SDS-PAGE, in 5 clones of pMKD0 1 transformed strains, the protein band 
estimated to be NCE3 was increased 3-4 times higher than the parent strain. 

40 (4) Identification of N-Terminal Amino Acid Residues of Recombinant NCE3 

[0095] In order to confirm that the protein band expressed in large amounts originated in the NCE3 gene based on 
the results of SDS-PAGE, the N-terminal amino acid sequence of this protein was determined. To begin with, column 
chromatography was performed on the culture supernatants obtained from the parent strain and NCE3 high-expression 

45 strains using an FPLC system in accordance with the method of the previously described Example A2, followed by com- 
parison of the major peaks . The peak that was particularly increased in the NCE3 high-expression strains was removed 
and lyophilized. This was dissolved in a small amount of water followed by electrophoresis using 8% Gel SDS-PAGE 
Mini (Difco). The protein was then electrically transferred to a PVDF membrane according to the method of Example A2 
above, and after staining with Coomassie brilliant blue R-250, the membrane was decolored and washed with water. A 

so portion in which a protein having a molecular weight of 66 KD was blotted was cut out from this membrane, and the 
modified N-terminal residues were removed from the blotted protein according to the method of Podell. D.N. et al. 
(Podell. D.N., et al., Biochem. Biophys. Res. Commun., 81 : 176, 1978). First of all, the target protein was cut out, and 
after incubating for 30 minutes at 37°C in a small amount of 0.5% polyvinylpyrrolidone (molecular weight: 40,000, 
Sigma)/100 mM acetic acid solution, was washed well with water. Then, the modified N-terminal residues were 

55 removed by Pfu pyroglutamate aminopeptidase (Takara Shuzo), washed with water and air dried. This was then applied 
to a Model 492 Protein Sequencer to determine 15 residues of the N-terminal amino acid sequence. The resulting 
sequence is shown below. 
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N-terminal amino acid sequence: Asn-Cys-Gly-Ser-Leu-Thr-Thr-Glu-Arg-His-Pro-Ser-Leu-Ser-Trp (15 residues) 

[0036] This N-terminal amino acid sequence coincided with the cellutase NCE2 and NCE3 fused protein amino acid 
sequence estimated from the base sequence of plasmid pMKDOl . 

5 

(5) FPLC Evaluation of pMKDOl Transformed Strains 

[0097] As was previously described, column chromatography was performed using an FPLC system to further quan- 
tify the culture supernatants of the 5 clones confirmed to express large amounts of NCE3 by SDS-PAGE. The condi* 
10 tions were the same as in (4) above. The NCE3 peak was removed and lyophilized, and the weight was measured and 
the productivity was compared between high expression strains and the parent strain. Those results were as shown in 
the table below. 



Table 4 





NCE3 Productivity* 


Humicola insolens MN200-1 (parent strain) 
Humicola insolens pMKDOl 


0.46 g 
1.8g 



* Productivity refers to the amount produced per one liter of culture liq- 
uid. 



Example B3: Construction of Plasmid pEGD01 

25 

[0098] Plasmid pMKDOl was digested with BamH i and restriction enzyme was inactivated by heat treatment at 70°C 
after which it was dephosphated to recover an 8.2 Kbp DNA fragment. 

[0099] Then, the NCE4 gene was amplified by PCR based on the sequence of the NCE4 gene originating in Humicola 
insolens obtained in the above Examples A1 through A7. This PCR product containing NCE4 was designed in a form 
30 in which each primer contained a BamH I site in advance so that it could be ligated while aligning the frame with the 8.2 
Kbp BamH I fragment of the above plasmid pMKDOl . Synthetic oligonucleotides having the sequences shown below 
were prepared for use as primers. 

NCE4-N: 5XCGGTGTTGGCCGGATCCGCTGATGGCAAG-3' (30mer) 
35 NCE4-C: 5*-TAAGGCCCTCAAGGATCCCTGCGTCTACAG-3' (30mer) 

[0100] The PCR reaction was carried out in the following manner. 1 uM of each primer, 400 uM dNTP and 2.5 U of 
Pfu DNA polymerase (Stratagene) were added to 1 ug of Humicola insolens genome DNA, and a 0.8 Kbp DNA frag- 
ment was amplified by repeating 25 cycles of reaction conditions consisting of 94°C for 1 minute, 55°C for 2 minutes 
ao and 72°C for 3 minutes. This 0.8 Kbp DNA fragment was recovered and ligated to the above 8.2 Kbp BamH I fragment 
of pMKDOl . This plasmid DNA was designated as pEGD01 . 

Example B4: Expression of Plasmid pEGD01 

45 (1) Transformation of Humicola Insolens by Plasmid pEGD01 

[0101] Transformation of Humicola insolens MN200-1 by plasmid pEGD01 was performed in accordance with the 
method of Example B2. To begin with, a highly concentrated and purified standard of pEGD01 was prepared to obtain 
1 ug/ul of pEGD01 plasmid DNA. Humicola insolens MN200-1 was transformed using 10 ul of this pEGD01 solution fol- 
50 lowed by selection of 50 transformant strains exhibiting resistance to hygromycin. These were cultured for 5 days at 
37°C in (N) medium. When the resulting culture supernatant was analyzed by SDS-PAGE, the protein band estimated 
to be NCE4 was increased 10-16 times more than in the parent strain in 10 clones of the strains transformed by 
pEGDOL 

55 (2) Identification of N-Terminal Amino Acid Residues of Recombinant NCE4 

[0102] In order to confirm that protein band expressed in large amounts originated in the NCE4 gene based on the 
results of SDS-PAGE, the N-terminal amino acid sequence of this protein was determined. To begin with, column chro- 
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matography was performed on the culture supernatants obtained from the parent strain and NCE4 high-expression 
strains using an FPLC system followed by comparison of the major peaks. The conditions were the same as in the 
above Example B2. Ttie peak that was particularly increased in the NCE4 high-expression strains was removed and 
lyophilized. This was dissolved in a small amount of water. After removing the modified N-terminal residues according 
5 to the method of Example B2, the N-terminal amino acid sequence was determined using the above protein sequencer. 
As a result, two types of N-terminal amino acid sequences were obtained in a ratio of about 7:3. The resulting 
sequences were as shown below. Then, the N-terminal amino acid sequence was determined using the same protein 
sequencer without removing the modified N-terminal. As a result, only amino acid sequence 1 was obtained as shown 
below. 

10 

N-terminal amino acid sequence 1: Val-Val-Glu-Glu-Arg-Gln-Asn-Cys-Gly-Ser-Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr- 
Trp-Asp (20 residues) 

N-terminal amino acid sequence 2: Asn-(Cys)-Gly-Ser-Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr-Trp-Asp-(Cys)-(Cys)-Lys- 
Pro-Ser-(Cys) (20 residues) 

15 

[0103] TTiese N-terminal amino acid sequences coincided with the cellulase NCE2 and NCE4 fused protein amino 
acid sequence estimated from the base sequence of plasmid pEGD01 . Since two types of N-terminal amino acid 
sequences were obtained, it was clearly shown that when the signal sequence of this fused protein is cleaved, it is proc- 
essed at a plurality of sites. 

20 

(3) FPLC Evaluation of pEGD01 Transformed Strains 

[01 04] As was previously described, column chromatography was performed using an FPLC system to further quan- 
tify the culture supernatants of the 5 clones confirmed to express large amounts of NCE4 by SDS-PAGE, and the NCE4 
25 peak was removed. This was then lyophilized. and the weight thereof was measured and the productivity was compared 
between high expression strains and the parent strain. Those results were as shown in the table below. 



Tables 





NCE4 Productivity* 


Humicola insolens MN200-1 (parent strain) 
Humicola insolens pMKEGt 


0.28 g 
4.5 g 



* Productivity refers to the amount produced per one liter of culture liq- 
uid. 



Example B5: Construction of Plasmid plED02 

40 (1) Construction of Plasmid plD01 

[0105] Plasmid pEGD01 was digested with Hind III and BamH I and a 7.2 Kbp DNA fragment was recovered. 
[0106] Then, the portion of DNA that codes for the promoter and signal sequences of NCE1 gene was amplified by 
PCR based on the sequence of NCE1 gene originating in the Humicola insolens obtained with the method described 
45 in Japanese Patent Laid-Open Publication No. 8- 56663.This PCR product that contains NCE1 promoter and signal 
sequence was designed in a form in which each primer contains an Hind III and BamH I site in advance so that it can 
be ligated to the 7.2 Kbp Hind III to BamH I fragment of the above plasmid pEGD01 . Synthetic oligonucleotides having 
the sequences shown below were prepared for use as primers. 

so PNCE1 -N: 5'-GTCATGAAGCTTCATTAAGGTACGTATGCAAC-3' (32mer) 

PNCE1-C: 5-GGTGATGGATCCGGCCTGCTGGGCAGCGACGC-3' (32mer) 

[0107] The PCR reaction was carried out in the same method as Example B3. 1 uM of each primer, 400 uM dNTP and 
2.5 U of Pfu DNA polymerase were added to 1 ug of Humicola insolens genome DNA, and a 1.5 Kbp DNA fragment 
55 was amplified by repeating 23 cycles of reaction conditions consisting of 94°C for 1 minute, 55°C for 2 minutes and 
72°C for 4 minutes. This 1 .5 Kbp DNA fragment was recovered by digesting the PCR product with Hind III and BamH I. 
It was then ligated to the above 7.2 Kbp Hind III to BamH I fragment of pEGD01 . This plasmid DNA was designated as 
plD01. 
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(2) Construction of Plasmid plED02 

[0108] Plasmld plD01 was digested with BamH I and restriction enzyme was inactivated by heat treatment at 70°C 
after which it was dephosphated to recover an 8.6 Kbp DNA fragment. Then, after digesting plasmid pEGD01 with 
BamH I , a 0.8 Kbp DNA fragment was recovered that contained NCE4 gene. The two fragments were ligated to obtain 
plasmid plED02. 

Example B6: Expression of Plasmid plED02 

(1) Transformation of Humicola Insolens by Plasmid plED02 

[0109] Transformation of Humicola insolens MN200-1 by plasmid p!ED02 was performed in accordance with the 
method of Example B2. To begin with, a highly concentrated and purified standard of plED02 was prepared to obtain 1 
ug/ul of plED02 plasmid DNA. Humicola insolens MN200-1 was transformed using 10 uJ of this plED02 solution fol- 
lowed by selection of 50 transformarrt strains exhibiting resistance to hygromycin. These were cultured for 5 days at 
37°C in (N) medium. When the resulting culture supernatant was analyzed by SDS-PAGE, the protein band estimated 
to be NCE4 was increased 5-10 times more than in the parent strain in 5 clones of the strains transformed by plED02. 

(2) Identification of N-Terminal Amino Acid Residues of Recombinant NCE4 

[01 1 0] In order to confirm that protein band expressed in large amounts originated in NCE4 gene based on the results 
of SDS-PAGE, the N-terminal amino acid sequence of this protein was determined. To begin with, according to the 
method of Example B2 column chromatography was performed on the culture supernatants obtained from the parent 
strain and the NCE4 high-expression strains, after which the NCE4 peak was removed and lyophilized. This was dis- 
solved in a small amount of water. After removing the modified N-terminal residues according to the method of Example 
B2, 15 residues of the N-terminal amino acid sequence were determined using the above protein sequencer. The 
resulting sequence was as shown below. 

N-terminal amino acid sequence: Gln-Ala-Gly-Ser-Ala-Asp-Gly-Lys-Ser-Thr-Arg-Tyr-Trp-Asp-(Cys) (15 residues) 

[01 1 1 ] This N-terminal, amino acid sequence coincided with the cellulase NCE1 and NCE4 fused protein amino acid 
sequence estimated from the base sequence of plasmid plED02. 

(3) FPLC Evaluation of plED02 Transformed Strains 

[01 1 2] As was previously described, column chromatography was performed using an FPLC system to further quan- 
tify the culture supernatants of the 5 clones confirmed to express large amounts of NCE4 by SDS-PAGE, and the NCE4 
peak was removed. This was then lyophilized, and the weight was measured and the productivity was compared 
between high expression strains and the parent strain. Those results were as shown in the table below. 



Table 6 





NCE4 Productivity* 


Humicola insolens MN200-1 (parent strain) 
Humicola insolens plED02 


0.28 g 
2.9 g 



* Productivity refers to the amount produced per one liter of culture liq- 
uid. 
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Sequence Listing 

SEQ ID Mo. : 1 

Length: 305 
Type: Amino acid 
Topology: linear 
Molecule type: Protein 
Sequence description: 

Met Arg Ser Ser Pro Leu Leu Arg Ser Ala Val Val Ala Ala Leu Pro 

-20 -15 -io 

Val Leu Ala Leu Ala Ala Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys 

"5 15 10 

Cys Lys Pro Ser Cys Gly Trp Ala Lys Lys Ala Pro Val Asn Gin Pro 

15 20 25 

Val Phe Ser Cys Asn Ala Asn Phe Gin Arg Leu Thr Asp Phe Asp Ala 

30 35 40 

Lye Ser Gly Cys Glu Pro Gly Gly Val Ala Tyr Ser Cys Ala Asp Gin 

45 50 55 

Thr Pro Trp Ala Val Asn Asp Asp Phe Ala Phe Gly Phe Ala Ala Thr 
60 65 70 75 

Ser He Ala Gly Ser Asn Glu Ala Gly Trp Cy6 Cys Ala Cys Tyr Glu 

80 85 90 

Leu Thr Phe Thr Ser Gly Pro Val Ala Gly Lys Lys Met Val Val Gin 

95 100 105 

Ser Thr Ser Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp Leu Asn 

HO 115 120 

He Pro Gly Gly Gly Val Gly He Phe Asp Gly Cys Thr Pro Gin Phe 

125 130 135 

Gly Gly Leu Pro Gly Gin Arg Tyr Gly Gly He Ser Ser Arg Asn Glu 
140 145 150 155 

Cye Asp Arg Phe Pro Asp Ala Leu Lys Pro Gly Cys Tyr Trp Arg Phe 

160 165 170 

Asp Trp Phe Lys Asn Ala Asp Asn Pro Ser Phe Ser Phe Arg Gin Val 

175 180 185 

Gin Cys Pro Ala Glu Leu Val Ala Arg Thr Gly Cys Arg Arg Asn Asp 
190 195 200 
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10 



15 



20 



30 



40 



Asp Gly Asn Phe Pro Ala Val Gin lie Pro Sar Ser Ser Thr Ser Ser 

205 210 215 

Pro Val Gly Gin Pro Thr Ser Thr Ser Thr Thr Ser Thr Ser Thr Thr 
220 225 230 235 

Ser Ser Pro Pro Val Gin Pro Thr Thr Pro Ser Gly Cya Thr Ala Glu 

240 245 250 

Arg Trp Ala Gin Cys Gly Gly Asn Gly Trp Ser Gly Cys Thr Thr Cys 

255 260 265 

Val Ala Gly Ser Thr Cys Thr Lys lie Asn Asp Trp Tyr His Gin Cys 
270 275 280 

Leu 

SEQ ID No.! 2 

Length: 1257 

Type: Nucleic acid 

Strandness : double 

Topology: linear 

Molecule type: Genomic DNA 

Original source: 

Organism: Humlcola insolene 
Feature i 

Feature Keyi Intron 

Location: 453.. 509 

Method of determining feature: E 
Sequence description: 

AATGACGGGG CAACCTCCCG CCCGGGCCCA ACTCTTGGGT TTGGTTTGAC AGGCCGTCTG 60 
TCTCTTGCGT CCTCTTACTA CGCCTGCCTG GACCCTACGT CTCAACTCCG ATTCAAG 117 
ATG CGT TCC TCC CCT CTC CTC CGC TCC GCC GTT GTG GCC GCC CTG CCG 165 
Met Arg Ser Ser Pro Leu Leu Arg Ser Ala Val Val Ala Ala Leu Pro 

-20 -15 -10 

GTG TTG GCC CTT GCC GCT GAT GGC AAG TCC ACC CGC TAC TGG GAC TGC 213 
Val Leu Ala Leu Ala Ala Asp Gly Lys Ser Thr Arg Tyr Trp Asp Cys 

-5 15 10 

TGC AAG CCT TCG TGC GGC TGG GCC AAG AAG GCT CCC GTG AAC CAG CCT 261 
Cys Lys Pro Ser Cys Gly Trp Ala Lys Lys Ala Pro Val Asn Gin Pro 
15 20 25 



55 
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GTC TTC TCC TGC AAC GCC AAC TTC CAG CGT CTC ACT GAG TTC GAG GCC 309 
Val Phe Ser Cys Asn Ale Asn Phe Gin Arg Leu Thr Asp Phe Asp Ala 

30 35 40 

AAG TCC GGC TGC GAG CCG GGC GGT GTC GCC TAC TCG TGC GCC GAC CAG 357 
Lya Ser Gly Cys Glu Pro Gly Gly Val Ala Tyr Ser Cys Ala Asp Gin 

45 50 55 

ACC CCA TGG GCT GTG AAC GAC GAC TTC GCG TTC GGT TTT GCT GCC ACC 405 
Thr Pro Trp Ala Val Asn Asp Asp Phe Ala Phe Gly Phe Ala Ala Thr 

60 65 70 75 

TCT ATT GCC GGC AGC AAT GAG GCG GGC TGG TGC TGC GCC TGC TAC GA 452 
Ser lie Ala Gly Ser Asn Glu Ala Gly Trp Cys Cys Ala Cys Tyr Gl 

60 85 90 

GTAAGCTTTG GTCGCGTGTG TAACACTGTG CAGGCATAGC ACTAACCACC TCCCAG G 509 

U 

CTC ACC TTC ACA TCC GGT CCT GTT GCT GGC AAG AAG ATG GTC GTC CAG 557 
Leu Thr Phe Thr Ser Gly Pro Val Ala Gly Lys Lys Met Val Val Gin 

95 100 105 

TCC ACC AGC ACT GGC GGT GAT CTT GGC AGC AAC CAC TTC GAT CTC AAC 605 
Ser Thr Ser Thr Gly Gly Asp Leu Gly Ser Asn His Phe Asp Leu Asn 

110 115 120 

ATC CCC GGC GGC GGC GTC GGC ATC TTC GAC GGA TGC ACT CCC CAG TTC 653 
30 He Pro Gly Gly Gly Val Gly He Phe Asp Gly Cys Thr Pro Gin Phe 

125 130 135 

GGC GGT CTG CCC GGC CAG CGC TAC GGC GGC ATC TCG TCC CGC AAC GAG 701 
Gly Gly Leu Pro Gly Gin Arg Tyr Gly Gly He Ser Ser Arg Asn Glu 
35 140 145 150 

TGC GAT CGG TTC CCC GAC GCC CTC AAG CCC GGC TGC TAC TGG CGC TTC 749 
Cys Asp Arg Phe Pro Asp Ala Leu Lys Pro Gly Cys Tyr Trp Arg Phe 
160 165 170 

40 GAC TGG TTC AAG AAC GCC GAC AAC CCG AGC TTC AGC TTC CGT CAG GTC 797 

Asp Trp Phe Lys Asn Ala Asp Asn Pro Ser Phe Ser Phe Arg Gin Val 

175 180 1SS 

CAA TGC CCA GCC GAG CTC GTC GCT CGC ACC GGA TGC CGC CGC AAC GAC 845 
Gin Cys Pro Ala Glu Leu Val Ala Arg Thr Gly Cys Arg Arg Asn Asp 
190 195 200 



10 



15 



20 



25 



45 



50 
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GAC GGC AAC TTC CCT GCC GTC CAG ATC CCC TCC AGC AGC ACC AGC TCT 893 
Aap Gly Asn Phe Pro Ala Val Gin lie Pro Ser Ser Ser Thr Ser Ser 

205 210 215 

CCG GTC GGC CAG CCT ACC AGT ACC AGC ACC ACC TCC ACC TCC ACC ACC 941 
Pro Val Gly Gin Pro Thr Ser Thr Ser Thr Thx Ser Thr Ser Thr Thr 
220 225 230 235 

TCG AGC CCG CCC GTC CAG CCT ACG ACT CCC AGC GGC TGC ACT GCT GAG 989 

Ser Ser Pro Pro Val Gin Pro Thx Thr Pro Ser Gly Cys Thr Ala Glu 

240 245 250 

AGG TGG GCT CAG TGC GGC GGC AAT GGC TGG AGC GGC TGC ACC ACC TGC 1037 
Arg Trp Ala Gin Cys Gly Gly Asn Gly Trp Ser Gly Cys Thr Thr Cys 

255 260 265 

GTC GCT GGC AGC ACC TGC ACG AAG ATT AAT GAC TGG TAC CAT CAG TGC 1085 
Val Ala Gly Ser Thr Cys Thr Lys He Asn Asp Trp Tyr His Gin Cys 

270 275 280 

CTG TAA ACGCAGGGCA GCCTGAGAAC CTTACTGGTT GCGCAACGAA ATGACACTCC 1141 
Leu 

CAATCACTGT ATTJLGTTCTT GTACATAATT TCGTCATCCC TCCAGGGATT GTCACATATA 1201 
TGCAATGATG AATACTGAAC ACAAACCTGG CCGCTTGAAC TGGCCGAAGG AATGCC 1257 



Claims 

1 . A protein or modified protein having a portion of an amino acid sequence shown in SEQ ID No. 1 or a sequence 
from position 1 to position 284 of the amino acid sequence shown in SEQ ID No. 1 . 

2. A protein or modified protein according to claim 1 further comprising a portion or all of the amino acid sequence 
from position -21 through -1 shown in SEQ ID No. 1 at N-terminal thereof. 

3. A protein or modified protein according to claims 1 or 2 having endoglucanase activity. 

4. A DNA sequence coding for the protein or modified protein according to any one of claims 1 to 3. 

5. A DNA sequence according to claim 4, wherein the DNA sequence has a portion or all of the base sequence shown 
in SEQ ID No. 2. 

6. A DNA sequence according to claim 5 having a base sequence from position 1 18 to position 1088 of the base 
sequence shown in SEQ ID No. 2. 

7. A vector comprising the DNA sequence according to any one of claims 4 to 6. 

8. A host cell transformed by the vector according to daim 7. 

9. A method for preparing a protein or modified protein according to any one of claims 1 to 3, comprising the steps of 
culturing the host cell according to claim 8 to obtain a culture, and collecting the protein or modified protein accord- 
ing to any one of claims 1 to 3 from the culture. 

10. A cellulase preparation comprising a protein or modified protein according to any one of claims 1 to 3. 
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1 1 . A method for removing nap of cellulose-containing fibers, the method comprising contacting the protein or modified 
protein according to any one of claims 1 to 3 or the cellulase preparation according to claim 7 with cellulose-con- 
taining fibers. 

5 12. A method for decoloring denim-dyed, cellulose-containing fibers, the method comprising contacting the protein or 
modified protein according to any one of claims 1 to 3 or the cellulase preparation according to claim 7 with denim- 
dyed, cellulose-containing fibers. 

13. A method for reducing cellulose-containing fibers, the method comprising contacting the protein or modified protein 
10 according to any one of claims 1 to 3 or the cellulase preparation according to claim 7 with cellulose-containing fib- 
ers. 

14. Use of the protein or modified protein according to any of claims 1 to 3 or the cellulase preparation according to 
claim 7 for removing nap of cellulose-containing fibers. 

15 

15. Use of the protein or modified protein according to any one of claims 1 to 3 or the cellulase preparation according 
to claim 7 for decoloring denim-dyed, cellulose-containing fibers. 

16. Use of the protein or modified protein according to any one of claims 1 to 3 or the cellulase preparation according 
20 to claim 7 for reducing cellulose-containing fibers. 



30 
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